Reconstructing hv-Convex Polyominoes 
from Orthogonal Projections 

Marek Chrobak^ 1 Christoph Diirr b 

Q\ . ^Department of Computer Science, University of California, 

Os ' Riverside, CA 92521-0304. marek@cs.ucr.edu 

^ , International Computer Science Institute, 1947 Center Street, Suite 600, 

Berkeley, CA 94704-1198. cduerr@icsi.berkeley.edu 



(N 

in 

q 

o 



Key words: Combinatorial problems, discrete tomography, polyominoes. 



1 Introduction 



Tomography is the area of reconstructing objects from projections. In discrete 
tomography an object T we wish to reconstruct may be a set of cells of a 
multidimensional grid. We perform measurements of T, each one involving a 

q\ . projection that determines the number of cells in T on all lines parallel to the 

projection's direction. Given a finite number of such measurements, we wish 
to reconstruct T or, if unique reconstruction is not possible, to compute any 
object consistent with these projections. Gardner et al. [7] proved that deciding 

^ if there is an object consistent with given measurements is NP-complete, even 

for three non-parallel projections in the 2D grid. 

In this paper we address the case of orthogonal (horizontal and vertical) pro- 
jections of a 2D grid. A given object T, defined as a set of cells in a m x n 
grid, can be identified with anmxn 0-1 matrix, where the l's determine the 
cells of T. We will use all three notations: set-theoretic, integer and boolean, 
whichever is most appropriate in a given context. 

The row and column sums of an object T are defined by rowsumi(T) = J2j Tij 
for i = 1, . . . ,m and colsum,j(T) = J2i.Tij for j = 1, ... ,n. The vectors 
rowsum(T) and colsum(T) represent the horizontal and vertical projections of 
T. (See Figure 1.) 
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Given two integer vectors r = (ri, . . . ,r m ) and c = 
(ci, . . . , c n ), a realization of (r, c) is an object T whose 
row and column sum vectors are r and c, that is: A 
rowsum(T) = r and colsum(T) = c. In the reconstruc- 
tion problem, given (r, c), we wish to find a realization 
T of (r, c), or to report failure if such T does not ex- 
ist. The corresponding decision problem is called the 
consistency problem. 
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Fig. 1: An object with 
its row and column 



sums. 



Properties of 0-1 matrices with given row and column 
sums have been studied in the literature since 1950's. 
We refer the reader to the work of Ryser [9], or to an excellent survey by 
Brualdi [4]. Ryser presents an 0(mn) algorithm for the reconstruction prob- 
lem. In fact, the realizations constructed by his algorithm can be represented 
and computed in time 0(m + n). 

For many objects, the orthogonal projections do not provide sufficient infor- 
mation for unique reconstruction. In this case, additional information about 
the object's structure could either lead to a unique realization, or at least 
substantially reduce the number of alternative solutions. Some research has 
been done on polyominoes , which are connected objects. More formally, we 
associate with an object T a graph, whose vertices are the cells of T, and 
edges join adjacent cells: ((i,j), (i'jf)) is an edge iff \i — i'\ + \j — j'\ < 1. If 
this graph is connected, then T is called a polyomino. Woeginger [10] proved 
that the consistency problem for polyominoes is NP-complete. 

Some geometric properties of polyominoes have been studied too. Call an 
object T horizontally convex (or h-convex) if the cells in each row i of T are 
consecutive, that is, if (i,k),(i,l) G T then (i,j) G T for all k < j < I. 
Vertically convex (v-convex) objects are defined analogously. The consistency 
problem for h-convex objects (whether we require that they are polyominoes 
or not) is also known to be NP-complete [2]. 

If T is both h-convex and v-convex, then we say that T is hv-convex. Kuba [8] 
initiated the study of hv-convex polyominoes and proposed a reconstruction 
algorithm with exponential worst-case time complexity. Quite surprisingly, as 
shown later by Barcucci et al [2], the reconstruction problem for hv-convex 
polyominoes can be solved in polynomial time. The algorithm given in [2] 
is, however, rather slow; its time complexity is 0{m A n i ). In another paper, 
Barcucci et al. [3] showed that certain "median" cells of (r, c) must belong to 
any hv-convex polyomino realization, and, using this result, they developed 
an 0(m 2 n 2 ) -time heuristic algorithm. This new algorithm is not guaranteed 
to correctly reconstruct an object, although the experiments reported in [3] 
indicate that for randomly chosen inputs this method is very unlikely to fail. 
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The main contribution of this paper is an 0(mnmin(m 2 ,n 2 ))-time algorithm 
for reconstructing hv-convex polyominoes. Our algorithm is several orders of 
magnitude faster than the best previously known algorithm from [2], and is 
also much simpler than the algorithms from [2,3,8]. In addition, we address a 
special case of centered hv-convex polyominoes, in which r, = n for some i. In 
other words, at least one row is completely filled with cells. For this case we 
show that the reconstruction problem can be solved in time 0(m + n). 



2 hv-Convex Polyominoes 



Throughout the paper, without loss of generality, we assume that r, c denote 
strictly positive row and column sum vectors. We also assume that J2i r i — 
J2jCj, since otherwise (r, c) do not have a realization. 

An object A is called an upper-left corner region if (i+l,j) G A or (i,j+l) G A 
implies (i,j) G A. In an analogous fashion we can define other corner regions. 
By T we denote the complement of T. The definition of hv-convex polyominoes 
directly implies the following lemma. 

Lemma 1 T is an hv-convex poly omino if and only ifT = AUBUCUD, where 
A, B, C , D are disjoint corner regions (upper-left, upper-right, lower-left and 
lower-right, respectively) such that (i) (i — l,j — l)EA implies (i,j) ^ D, and 
(ii) (i — 1, j + 1) G B implies (i,j) ^ C . 

Given an hv-convex polyomino T and two row indices 1 < k, I < m, we say 
that T is anchored at (k,l) if (k,l),(l,n) G T. The idea of our algorithm 
is, given (r, c), to construct a 2SAT expression (a boolean expression in con- 
junctive normal form with at most two literals in each clause) F^r, c) with 
the property that i*fc,j(r, c) is satisfiable iff there is an hv-convex polyomino 
realization T of (r, c) that is anchored at (k,l). i*fc,j(r, c) consists of several 
sets of clauses, each set expressing a certain property: "Corners" (Cor), "Dis- 
jointness" (Dis), "Connectivity" (Con), "Anchors" (Anc), "Lower bound on 
column sums" (LBC) and "Upper bound on row sums" (UBR). 

, , a i Aij =>■ Ai^ij Bij =>■ ijj-ij Cij =>■ Gj+ij Djj =>■ Di + ij 

Dis = f\ [X i:j => Y id : for symbols X, Y G {A, £, C, D}, X ^ Y } 

Con = /\ {A id => A+ij+i ^j => Ci+ij.i} 

,4nc = Z fc ,i A S fci i A C fc ,i A D fc ,i A A, n A Si n A Ci „ A Di n 
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77 op A I /\i<min{fc,Z} A,j ^ ^ij+ri l\k<i<l ^i,j ^ ^i,j+ri 

l\l<i<k Aij => Dij +n /\max{k,l}<i ^ i,j ^ ^i,j+ri 

Define F M (r,c) = Cor A £>zs A Con A ,4nc A L5C A C/RR. All literals with 
indices outside the set {1, . . . , m\ x {1, . . . , n} are assumed to have value 1. 

Algorithm 1 

Input: reN m ,c£ N n 

W.l.o.g assume: Vi : r\ G [l,n], Vj : c,- G [l,ra], Si r i = Z)j c j an d rn <n. 

For k, I = 1, ... ,m do begin 

If -Ffc,z(r, c) is satisfiable, 

then output T = iU5UCUD and halt, 
end 
output "failure". 

Theorem 2 i^(r, c) zs satisfiable if and only if (r, c) /iawe a realization T 
that is an hv-convex polyomino anchored at (k,l). 

Proof (<=) If T is an hv-convex polyomino realization of (r, c) anchored at 
(k,l), then let A,B,C,D be the corner regions from Lemma 1. By Lemma 1, 
A, B, C and D satisfy conditions (Cor), (Dis) and (Con). Condition (Anc) is 
true because T is anchored at (k,l). (LBC) and (UBR) hold because T is a 
realization of (r, c). 

(=») If F kt i(r,c) is satisfiable, take T = AUBUCUD. Conditions (Cor), 
(Dis) and (Con) imply that the sets A, B, C, D satisfy Lemma 1, and thus T 
is an hv-convex polyomino. Also, by (Anc), T is anchored at (k,l). 

It remains to show that T is a realization of (r,c). Conditions (UBR) and 
(LBC) imply that rowsunii(T) < Ti for each row i, and colsunij(T) > Cj for 
each column j. Thus 



5I r * — X] rowsum i(T) = X] colsunij(T) > y^ 



■j- 



Since J2t r % = J2j Cj, T must be a realization of (r, c), and the proof is complete. 

□ 

Each formula i*fc,/(r, c ) has size 0(mn) and can be computed in time 0(mn). 
Since 2SAT can be solved in linear time [1,6], we obtain the following result. 

Theorem 3 Algorithm 1 solves the reconstruction problem for hv-convex poly- 
ominoes in time 0(mnmm(m 2 ,n 2 )). 
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An implementation (see [5]) of Algorithm 1 can be made more efficient by 
reducing the number of choices for k, I. First, restrict k to multiples of c±, and / 
to multiples of c n . Second, let mi be the largest index for which r\ < • • • < r mi , 
and let m 2 be the smallest index for which r m2 > ■ ■ ■ > r m . It is easy to see 
that we can assume that min{A;,/} < mi and max {k,l} > 1712 (see [3].) 
Analogously we define column indices n± and n 2 . With these restrictions on 
k, I, we run the 2SAT algorithm only 0(mm(mim2/ciC n , nin 2 /rir m )) times. 

One can also remove redundant clauses from formulas -Ffc,/(r, c). For example, 
in the clauses (Dis), we only need to state that regions AnB = and CflD = 0. 
The other disjointness relations follow from condition (LBC). Finally, it is not 
necessary to reconstruct each -Ffc,«( r ? c ) from scratch, since only clauses (UBR) 
and (Anc) depend on k and I. 



3 Centered hv-Convex Polyominoes 



Structure of realizations. A rectangle R is the intersection of a set of 
rows and a set of columns, say R = I x J, for / C {1, . . . ,m} and J C 
{1, . . . , n}. The rectangle orthogonal to R is R 1 - = I x J, where / and J are 
the complements of J, J. 

The lemma below follows from the work of Ryser [9] (see also [4]). We include 
a simple proof for completeness. 

Lemma 4 Let R = I x J be a rectangle such that J2i^i r i — Y^jijCj-\-\I\ ■ \ J\. 
Then each realization T of (r, c) satisfies R C T and R 1 - fl T = 0. 

Proof If T is a realization of (r, c) then 

l^nT|-|^nT| = EE^-EE^ = Y.Y.t^-Y.Y.t^ 

iei jeJ i0 j^j iei j i j^j 

= J2 r i-J2 c j = l 7 H J l> 

iei j£j 

and therefore R C T and R L n T = 0. □ 

Representation of hv-convex realizations. Since we are interested in 
centered hv-convex realizations, we assume now that rj, = n, for some row k. 
We could also assume that the row sums satisfy r± < ■ ■ ■ < r^ and r& > • • • > 
r m , although our algorithm does not use this assumption. 

We represent an hv-convex realization as a vector T = (t 1} . . . ,t m ), where 
each ti G [l,ra — r, + 1] is the smallest j for which (2, j) e T. Thus (2, j) G 
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T is equivalent to U < j < U + r«. By definition, such realizations satisfy 
the horizontal convexity and the row-sum conditions. The vertical convexity 
condition can be written as 

U+5 <U< t i+s + r i+s - ri (VC) 

where 5 = 1 for i < k and — 1 for i > k. The column sums can be expressed 
by the U as colsurrij(T) = \{i:U < j < U + ri}\. 

Partial realizations. Let X = (t p , . . . ,t q ), where (p,q) 7^ (1, ?7z) and k G 
\p,q], be a vector in which ti G [l,m — r* + 1], for each i. We call column j 
unsaturated if colsurrij(X) < Cj, where colsurrij(X) = \{i:U < j < U +Tj}|. 
Let ax be the first unsaturated column and j3x the last. (We will omit the 
subscript X if X is understood from context.) X is called a [p, q]-realization 
(or a partial realization) if it satisfies condition (VC) for i = p, . . . ,q, and 

min {t p , t g } < a < (3 < max {t p + r p , t q + r q }, 
colsurrij(X) = Cj for j ^ [a, 0\. 

If max {t p , tq} < a and /3 < min {t p + r p , t q + r q }, then X is called balanced. 
Note that in the definition of partial realizations we do not insist that 

colsumj(X) < Cj for j — a + 1, . . . , (5 — 1. (1) 

We call X valid if it satisfies (1). Clearly, an invalid partial realization cannot 
be extended to a realization, but to facilitate a linear-time implementation we 
will verify (1) only for balanced partial realizations. 

Our algorithm will attempt to construct a realization row by row in the order 
of decreasing sums. A [p, g]-realization X will be extended to row p — 1 or 
q + 1, whichever has larger sum. Each X has at most two extensions, and it 
may have two extensions only if it is balanced. The naive approach would be to 
explore recursively all extensions, but, if the number of balanced realizations 
is large, it could result in exponential running time. To reduce the number 
of partial realizations to explore we show, using Lemma 4, that if (r, c) has 
any hv-convex realization at all, then each valid balanced [p, ^-realization 
X can be extended to a realization of (r,c). Therefore when we encounter 
a valid balanced [p, ^-realization, we can discard all other [p, ^-realizations. 
Consequently, we keep at most two partial realizations at each step. 

Suppose that either q = m or r p _x > r q+ i, and let lext p _i(X), rext p _ 1 (X) be 
the extensions of X with t P -\ = a and t p -\ = j3 — r v _\ + 1, respectively. If 
X is non-balanced, or balanced and valid, Ext v ^\ A (X) is the set containing 
these vectors in {lextp-i(X), rext p -i(X)} which are [p — 1, ^-realizations. If X 
is balanced but not valid then Extp-i yq (X) = 0. Analogously we define the set 
Ext Pt g + i(X) for the case when p = 1 or r q+1 > r p _i. 
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Algorithm 2 (See Fig. 2) 

Input: r G N m , c G N n 

W.l.o.g. assume: Vi : Tj G [1, n], Vj : c, G [1, m], £}, r « = Sj c j an d 3/c : r fc = n. 

t fc <- 1, X «- (t fc ), P <- {^}, (p, <?) <- (A;, fc), 
While (p, q) ^ (l,w) do begin 

If g = m or r p _i > r g+ i then p <— p — 1 else q <— q + 1 

If 3X G P such that X is valid and balanced 

then P <- Ext M (X) 

else P <- UxeP Ext PA (X) 
end 
If P contains a realization T then output T else output "failure" . 



Fig. 2. Execution of Algorithm 2 on r = (1, 4, 5, 3, 1) and c = (2, 4, 4, 2, 2). 
The correctness of Algorithm 2 follows from the following lemma. 

Lemma 5 Suppose that (r, c) has an hv-convex realization. Then, after each 
step of Algorithm 2, |P| < 2 and there is X G P that can be extended to an 
hv-convex realization of (r, c) . 



Proof The proof is by induction on q — p + 1. Denote by P Pi9 the set P at 
the beginning of the while loop. The lemma holds trivially for p = q = k. 
Assume that it holds for some P P)? . Without loss of generality, assume that 
either q 



m or r p _! > r q+ \. 



Suppose first that P p<q contains a valid balanced [p, q] -realization X. Let / = 
[P,Q}, J = [(*x,Px\ and R = I x J. Then £ ie/ r; = Y^j^J c j + l J l " 1^1- % 
Lemma 4, we get that for each realization T = (z%, . . . ,z m ), ROT and 
R 1 - fl T = 0. If we replace z p , . . . , z q by t p , . . . , t q , we get another realization T' 
that is an extension of X. Since q = m or r p _x > r g+1 , z p _x must be either ax 
or /5x — fp-i + 1- Therefore T" is an extension of one of rextp-i(X), lext p ^i(X) . 
Since P p -i, g = Ext p ^i tq (X), the lemma holds for P p -i, g . 



If P P)9 does not contain a valid balanced [p, g]-realization then, for X G P 



p,qi 
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either ax < m&x{t p ,t q } or fix > min {t p + r p , t q + r q }. Thus each X G P Pi9 
gives rise to at most one extension in P p _ 1(? , so we have |P p _i, 9 | < 2. Pick 
X G P Pi q that can be extended to a realization T = (t±, . . . ,t m ). We can 
assume ax < max{£ p , t q }, since the other case is symmetric. If t p < ax < t q , 
then t p -i = ax, and T is an extension of lext p -\{X) G P p _i i(? . If t q < ax < t p , 
then t q+ i = ax, and thus, by £ p _i > t q +i, we have £ p _i — fix — f' v -\ + 1- 
Therefore T is an extension of rext p _i(X) G P p _i ig . □ 

Linear-time implementation. A naive implementation of Algorithm 2 
takes time 0(mn). We show how to reduce the running time to 0(m + n). 

The computation can be divided into phases, where each phase starts when a 
valid balanced partial realization Y is encountered. During a phase, for each 
X G P, we store only rows which are not in Y, and when the phase ends 
the new rows are copied into Y. We also maintain the indices a = ax and 
fi = fix- Call a column j active if j G [a,/3]. To keep track of the sums of 
the active columns, we use two arrays hisurrij and losurrij which represent, 
respectively, the column sums of rows [p, k — 1] and [k + 1, q] in X. Only the 
values of hisurrij for j G [ax,t p — 1] U [t p + r p , fix] are stored explicitly. For 
j = [t p , t p + r p — 1], we know that hisurrij = k — p — 1. When t p increases or 
t p + r p decreases, we simply fill the new entries with the correct values. The 
array losurrij is maintained analogously. In this way, we can maintain these 
arrays in total time Oirri + n) and we can query colsurrij in time 0(1), since 
colsurrij = hisurrij + 1 + losurrij. 

It remains to show that condition (1) can be tested in total time Oirri + n). 
We keep the list A of the active columns j in the order of increasing sums. 
Let £ be the first column in A, that is q = mhijg^^] Cj. Then for a balanced 
realization X, (1) holds iff q— p + 1 < q, and thus each verification of (1) takes 
time 0(1). Initially, A can be constructed in time Oirn + n) by bucket-sort. 
Each active column has a pointer to its record in A. When we increase a or 
decrease fi, we simply remove from A the columns that become non-active, at 
cost 0(1) per deletion. 

In summary, we obtain the following result. 

Theorem 6 Algorithm 2 solves the reconstruction problem for centered hv- 
convex polyominoes, and it can be implemented in time 0(m + n). 



4 Final Comments 



To speed up Algorithm 1 further, one can explore the fact that the consecu- 
tive expressions F k> i(r, c) differ only very slightly. Thus, an efficient dynamic 
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algorithm for 2SAT could be used to improve the time complexity. The 2SAT 
problem is closely related to strong connectivity in directed graphs, for which 
no efficient dynamic algorithms are known. Nevertheless, it may be possible 
to take advantage of the special structure of the 2SAT instances arising in our 
algorithm, as well as the fact that the sequence of clause insertions/deletions 
in Fk,i(r, c) is known in advance. 

It would be interesting to investigate polyominoes which are digital images of 
convex shapes. Call a polyomino T convex if its convex hull does not contain 
any whole cells outside T. How fast can we reconstruct convex polyominoes? 
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